Picture for Jinsong Lan

Jinsong Lan

iTryOn: Mastering Interactive Video Virtual Try-On with Spatial-Semantic Guidance

Add code
May 20, 2026
Viaarxiv icon

Improving Human Image Animation via Semantic Representation Alignment

Add code
May 11, 2026
Viaarxiv icon

Continuous-Time Distribution Matching for Few-Step Diffusion Distillation

Add code
May 07, 2026
Viaarxiv icon

Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items

Add code
Apr 22, 2026
Viaarxiv icon

Cross-modal Identity Mapping: Minimizing Information Loss in Modality Conversion via Reinforcement Learning

Add code
Mar 02, 2026
Viaarxiv icon

Pailitao-VL: Unified Embedding and Reranker for Real-Time Multi-Modal Industrial Search

Add code
Feb 14, 2026
Viaarxiv icon

REVISION:Reflective Intent Mining and Online Reasoning Auxiliary for E-commerce Visual Search System Optimization

Add code
Oct 26, 2025
Viaarxiv icon

MMKB-RAG: A Multi-Modal Knowledge-Based Retrieval-Augmented Generation Framework

Add code
Apr 15, 2025
Viaarxiv icon

Squeeze Out Tokens from Sample for Finer-Grained Data Governance

Add code
Mar 18, 2025
Viaarxiv icon

Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training

Add code
Nov 30, 2024
Figure 1 for Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
Figure 2 for Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
Figure 3 for Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
Figure 4 for Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
Viaarxiv icon